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ABERRAKT, EPIDERMAL GROWTH FACTOR RECEPTOR DNA, 
RMA AKD PROTEIN FORMS AND METHOD 

This invention concerns aberrant DNA, RNA and protein 
fonns of epidermal growth factor receptor. These forms can 
be used as tumor markers, used to assay effects of therapy 
or as therapy guidelines and to classify tumor types. These 
tumor types can then be used for further therapy guidelines. 

Description of the Dravipg 
Fig. 1 a) EGFR cDNA {3816bp coding sequence) is 
described with respect to the regions coding for 

2 6 

Structural/ functional domains of corresponding protein . 
SP: signal peptide TM: transmembrane. 

b) Enlargement of the first 1500 bases showing 
positions and sizes of the exons and introns 1 through 7. 
Shaded region: coding sequences eliminated from aberrant 
transcripts in the six glioblastomas examined. Locations of 
oligonucleotides used in the study are indicated, Nximerical 
sequence identities of oligonucleotides (5' to 3') based 
upon Ullrich et al. 1984^S PC46=50-1; PC56-1-50; PC66=172- 
193; PC15=428-"379; PC29=914-865 ; PC83=979-1000 ; PC84=1000- 
1021; PC88=1054-1075; PC89=1097-1076 ; PC85=1099-1078 ; 
PC77=1167-1146; PC59=1250-1201; PC67-1356-1335 ; PC58=1490" 
1441. In addition a cDNA probe, pE7 (including 
approximately bases 650-3000^*) , was used on Sad digested 
DNA for screening for aberrant restriction fragments (RF) in 
the tximors. oligonucleotides used to examine amplified 
genes 3' of the deleted region (not shown; all anti-sense): 
PC54=1850-1801; PC17=2100-2051; PC64-2549-2500 ; PC63=2899- 
2850; PC47=3299-3250; PC34=3699-3650 • 

Fig. 2. Genomic analysis of EGFR rearrangements. 
Tumor (T) DNA from three patients, P^-^s were hybridized with 
pE7 and PC59 (see Fig. 1) . A normal Sacl restriction 
fragment (RF) pattern resulting from hybridization of pE7 is 
displayed for P^. Hybridization of pE7 to each of the tumor 
DNAs revealed a relative depletion of the normal lOkb Sad 
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RF. The l.751cb RF identified by pE7 in normal DNA is also 
identified by PCS 9. For tumor DNAs from P^ and P^, 
hybridization with PC59 revealed additional, t\2mor specific, 
RF's of 2.4kb (Pj,) and 6*5kb (P^) . These aberrant bands were 
also revealed by pE7 hybridization, although the aberrant 
6.5 kb RF of P^ was somewhat obscured by a normal RF of 
similar size. These aberrant RF's result from the deletion . 
rearrangement elimination of the Sad site defining the 5' 
and of 1.75kb fragment, located in intron 7. Patient P3 
showed no rearrangement of the 1.75kb fragment. However, 
use of an intron 7 probe, Int7 (consists of 22 bases of exon 
7, all of intron 7, and 24 bases of exon 8) revealed, in 
addition to the normal 1.75kb and 3kb RF, an aberrant 5kb RF 
indicating that the 3' end of the rearrangement in this 
tximor occurred 5' of the Sad site in intron 7. 
METHODS: DNA isolation, digestion, Southern blotting and 
radiolabelling of pE7 and Int7 by random oligonucleotide 
priming, as previously described 21^27^ Radiolabelling of 
oligonucleotide probes by 3 'extension (terminal transferase, 
'''^^PdCTP) , Int7 produced by PGR exon-exon connection 
strategy with oligoprimers PC84 and PCS 5 from normal DNA. 

Fig. 3 a) An autoradiogram composite indicating 
aberrant EGFR transcripts in glioblastomas. Upper part 
constructed by surprimposing horizontally displaced 
autoradiograuas (open lane between samples) resulting from 
hybridization of PC29 and rehybridization of PC59 to the 
same filter. The lower portion represents an autoradiogram 
resulting from rehybridization of a GAPDH oligonucleotide 
probe (50bp) to the same filter, to check sample quantity. 
A431 cells which have amplified EGFR and show a lOkb and a 
2.9kb transcript " were included as a control. P^ shows 
expression of a normal lOkb transcript when probed with PC29 
(only visible at long exposures; arrow) as well as an 
aberrant, over expressed, shortened transcript which is 
detected with PC59.^ P^ shows an overexpressed 10 kb 
transcript as well as the same phenomenon as P.. 
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METHODS: Total RNA isolated from frozen tiimor tissue by 
polytrone homogenization in guanidine isothiocyanate buffer 
followed by ultracentrifugation on a CsCl gradient. Twenty 
/ig RNA was electrophoresed on a denaturing 1% agarose gel, 
blotted to Hybond membrane and hybridized to 
radiolabelled synthetic oligonucleotides (see Fig. 2). 

b) Ethidium bromide stained gel of the products 
from PCR amplification of EGFR cDNA with primer pair PC66 
and PC67 (Fig. 1). Lanes 1 and 10, 123bp ladder; lanes 2 
and 3, product from two specimens of non-neoplastic brain, 
removed in the surgical treatment of epilepsy; lanes 4-9, 
products from the six tumors Pi-Pg* 

METHODS: Total RNA isolated as in Fig. 3a, Single stranded 
cDNA produced using MoMuLV reverse transcriptase and random 
priming with hexanucleotides PCR reaction: 3 0 cycles; 

94°C/1 min.; 55°C/1 min. ; 72°C/3 min- + 10 sec/cycle, last 

cycle 10 min. 

c) Southern blot of b) probed with PC59 (see Fig. 
1) . In autoradiograms exposed longer the normal 1185bp band 
could be shown in all samples. 

d) Southern blot of b) probed with PC2 9 (see Fig. 
1) . Hybridizing with PC15 (not shown) gave similar results 
(in autoradiograms exposed longer the 1185bp band could be 
shown in all samples) . 

METHODS: c) and d) alkali blotting to Gene-Screen-Plus*^ 
nylon membranes, oligonucleotides radiolabelled as for Fig. 
2. 

Fig. 4. Autoradiogram of a sequence gel (reading 
anti-sense) from P^ (left) and P^ (right). A comparison of 
this sequence with that reported for the first exon of 

EGFR shows that the last 70 nucleotides (only 2 6 shown) 
before the 5' splice site, in the sense sequence are 
identical to the last 70 nucleotides in exon 1. The 
aberrant splicing (splice point arrowed) occurs after the 
first base in a condon (last base in exon 1) and continues 
with the last two bases from another condon (first two bases 
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of xon B, see text), thus th rearranged mRNA remains in- 
frame with the loss of 267 condons and the production of a 
new codon'"GGT" (Gly) . The aberrant mRNA codes for the 
normal signal peptide and a truncated EGFR consisting of the 
first five N-terminal amino-acids, the loss of the following 
267 amino-acids, the addition of a glycine, and then remains 
in-frame reading from condon 274. 

METHODS: Double stranded cDNA was produced and amplified as 
described in Fig- 3 and isolated from a 1% agarose gel using 
Gene-Clean^. cDNA. (1.5 ng) was then used in a unbalanced 
PGR reaction (30 cycles; 94^C/1 min.; 55°C/l-min.; 72°C/3 
min. + 10 sec. /cycle, last cycle, 10 min. Primers: PC66, 
50pM; PC67, l,5pM) to produce a sense single stranded (ss) 
cDNA template which was isolated from a 1% agarose gel by 
f reeze-thawing. Approximately lOOng of this sscDNA was then 
primed with PC77 (0.6pM) and chain termination sequenced 
using a Sequence^ kit (U.S. Biochem. Corp.) according to the 
manufacturers recommendations. 

Description of the Invention 

Several molecular anomalies have been documented in 
glioblastoma and maybe responsible for their malignant 
phenotype. Amplification, overexpression and rearrangements 
of EGFR have been reported in glioblastomas and studies of 
glioma cell lines have suggested the possibility of auto- or 
paracine loops involving TGF a/EGFR in these tumors. We^ 
have studied gene dose, gene structure as well as the 
structure and level of mRNA transcripts for EGFR, TGFa and 
EGF in a series of 30 primary human glioblastomas. DNA and 
RNA was studied on Southern and Northern blots respectively 
using anti-sense synthetic oligonucleotides to different 
parts of the EGFR, EGF and TGFa genes. For detailed 
studies, dscDNA was randomly produced from total RNA by 
reverse transcription followed by specific PCR 
amplification. Unbalanced PCJR was used to produce sscDNA 
templates for dideoxy sequencing. 

The gene for EGFR was amplified between 9-112 times 
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in just ver 50% of the tumors. Structural abnormalities in 
the amplicon were detected in 43% of the amplified cases. 
Five of six aberrantly amplified cases showed truncated 
transcripts on Northern blots. Aberrant transcripts were 
also found in cases with apparently normal restriction 
fragments of the gene as well as in tumors with non- 
amplified normal genes. The structural abnormalities 
clustered in either the extracellular or intracellular 
domains. .PGR amplification and nucleotide sequencing of 
t\amor mRNA revealed an identical loss of sequence affecting 
the extracellular domain in 5 studied cases. In addition, 
we found expression of mRNA for EGF at low but detectable 
levels and/or TGFa at varying levels in all glioblastomas 
studied. 

Glioblastomas may have an amplified, overexpressed, 
and in a large percentage of cases a structurally altered 
EGFR. They also express either EGF or TGFa or both. The 
aberrations of EGFR are commonly found in the extracellular 
region and are identical in many cases. The truncated 
receptor may function analogously to the erb-B gene product. 

The epidermal growth factor receptor (EGFR) gene has 
been found to be amplified and rearranged in human 
glioblastomas in vivo However, no sequence details of 

such rearrangements have been published. Here we present 
the sequence across a novel splice junction of aberrant EGFR 
transcripts, derived from corresponding and uniquely 
rearranged genes, which are co-amplied and co-expressed with 
non-rearranged EGFR genes in six, primary, human 
glioblastomas. Each of the six cases examined reveals 
aberrant transcripts derived from identical splicing of exon 
1 to exon 8 as a consequence of a deletion-rearrangement of 
the amplified gene, the extent of which is variable amongst 
these tumors. In spite of their intertumoral variability, 
these intragenic rearrangements result in the loss of 801 
coding bases (exons 2 through 7) and creation of a new codon 
at the novel splice site in their corresponding transcripts. 
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The mRNA sequence for the signal peptide, the first five 
codons, and the reading frame 3' of the rearrangement, are 
intact. 

The EGFR protein is a 170kD transmembrane 
glycoprotein found on many normal and malignant cells ^' 

. The extracellular binding of one of its two known 
endogenous ligands, epidermal growth factor and transforming 
growth factor a, results in conf onaational changes of the 
extracellular domain the activation of the receptor's 
intracellular tyrosine kinase activity , and the 

stimulation of DNA synthesis. A constitutive ly activated, 
and cell transforming variant of this receptor, with most of 
the extracellular domain deleted and further car boxy terminal 
deletions and mutations, is encoded by the v-erb B oncogene 
of avian erythroblastosis virus 

EGFR gene amplification is exclusively seen in the 
most malignant variants of gliomas, especially the 
glioblastomas Studies of DNA, mRNA, and protein from 

primary human glioblastomas and xenografted glioblastomas 
with EGFR amplification, indicate the existence of truncated 
EGF receptors in such tumors Most results implicate the 

loss of coding sequences for the extracellular domain 



We have studied the DNA and RNA from primary tumor 
tissue from six patients with brain tumors 

histopathologically classified as glioblastomas where 
the tumors were determined to have amplification of the EGFR 
gene Using a series of EGFR mRNA complementary 

oligonucleotide probes (see Fig. 1) on Southern blots, it 
was possible to determine a region of the gene which, 
amongst these tumors, displayed a relatively reduced level 
of amplification (sequences complementary to PC15 and PC29; 
see, for example, the lOkb (recognized by PC15) restriction 
fragment (RF) in the tumor DNA of patient P^^, see Fig. 2). 
According to Haley et al. this region lies between intron 
1 and intron 7. Use of the oligoprobe PC59, whose sequence 
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corresponds to the 3' end of exon 8 (Fig. 1), revealed an 
amplified, tumor DNA specific, rearranged Sad RF in some of 
the cases (see P^, Pj in Fig. 2). Intertumoral variable 
amounts (Fig. 2) . of an amplified normal 1.75kb FR (SacI 
sites in introns 7 and 8) were present in addition to the 
aberrant RFs, and the ratio of the amplified rearranged RF 
was unique to each tumor. In contrast some patients, 
including patient P3 (Fig. 2) showed no rearrangement of the 
1.75kb SacI RF. However, for patient P3, a rearrangement of 
the contiguous 5' normal 3)cb RF, identified by Int7 [see 
Fig. 2 for details] and PC29, was identified. These data 
demonstrate the variability in genomic location of the 3' 
end of the intragenic deletion-rearrangement. Since the 
variation in the sizes of the aberrant fragments detected by 
PC59 in P^ and P^ (4.1}cb) cannot be accounted for by 
differences in the position of the rearrangement site in the 
1.75)cb normal fragment, these data indicate variability of 
the 5' end of the rearrangement as well. 

The Northern blot analysis showed variable 
expression of the normal lOkb transcript in all cases. In 
addition, overexpressed aberrant transcripts could be 
detected in at least five of the cases, two examples of 
which are shown in Fig. 3a. The aberrant transcripts 
hybridized to all antisense oligonucleotides (Fig. 1) except 
PC29 (Fig. 3a) and PC15 (not shown). As will be shown 
below, the aberrant transcripts lacked an internal stretch 
of 801 bases. 

To determine the consequences of these 
rearrangements on the EGFR xnRNA, a pair of oligonucleotide 
primers containing sense sequences 5' (exon 1) and antisense 
sequences 3' (exon 8) of the deleted region were used for 
the PCR amplification of EGFR cDNA from each tumor. This 
procedure should normally result in the amplification of a 
liaSbp DNA fragment (bases 172-1356 of laRNA) . However, in 
all cases an abnormal fragment of appox. 380bp, in addition 
to the normal 1185bp fragment were observed (Fig. 3c). The 
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yield of the latter band varied between tumors. As 
anticipated, oligoprobes PC15 and PC29 hybridized to the 
normal fragment, but not to the shorter, aberrant fragment 
(Fig. 3d) whereas PC59 detected both fragments (Fig. 3c) . 
We thus concluded that this 380 bp fragment includes the 
flanking coding sequences on either side of the deletion, 
and that all the tumors appeared to have lost approximately* 
the same number- of bases from the aberrant EGFR transcript • 
Sequencing of the this 380bp fragment (Fig. 4) revealed that 
the last nucleotide of the first exon (base 274) had 
been spliced to base 1076 in each of the six tumors. This 
novel splice results in the creation of a glycine codon 
(GGT) at the sixth codon position and the in-frame 
apposition of what would normally be codon 273 to the 
seventh codon position of the aberrant transcript. 

Our results indicated that these aberrant 
transcripts contain the entire exon i 24, 25 followed by a 
sequence starting at base 52 of the 150 bases long exon 8 
To clarify this further, the position and size of intron 7 
was determined by using an exon-exon connection strategy. A 
contiguous primer pair, one ending at sence base 1075 
(hypothesized as being the last nucleotide in exon 7) (PC88) 
and the second ending at anti-sense-base 107 6 (hypothesized 
as being the first nucleotide in exon 8) (PC89), was used to 
PGR amplify non-coding DNA which might interrupt these 
sequences. The PGR product using these primers on two 
different normal DNA templates (white blood cells) was 
approximately 1780bp (data not shown) . This result is 
consistent with the hypothesis that intron 7 lists between 
bases 1075 and 1076 and thus the reported 3' and 5' ends of 
introns 7 and 8 respectively are incorrect. To support 
the EGFR-gene authenticity of this fragment, 5' sense and 
3'antisense primers, outside PC88 auid PC89, were substituted 
for one or the other of the original primers to amplify a 
slightly larger ger^omic fragment containing known coding 
sequences (approx. 1850bp) . As expected, the primers 
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internal to the primers used, hybridized to the amplified 
product in each case. On the basis of these data, we 
conclude that the sequence observed in the aberrant mRNA 
transcripts results from the splicing of exon 1 to exon 8. 
This would explain how the internal deletion in the mRNA 
transcript is always the same, despite the fact that the 
sites of the genomic deletion-rearrangement varies in 
individual tumors (Fig. 2). 

It is noteworthy that the aberrant EGFR 
gene/ transcript has only been found in tumors also having a 
non-rearranged amplified gene and therefore would presumably 
arise during the amplification process;, in 66 gliomas we 
have studied, no gross genomic or transcript rearrangement 
has been observed in the absence of amplification (Ekstrand 
J. in preparation) . Tumors with an amplified, rearranged 
gene co-express normal and aberrant transcripts. Similar 
rearrangements may also occur in the other tumor types which 
show amplification of the EGFR gene ""^^ The finding of 
identical abnormal splicing of exon 1 to 8 in the primary 
glioblastomas of six different patients suggests that the 
corresponding truncated EGFR has biological significance. 
An identical deletion in the EGFR transcript has been found 
in two xenografts of glioblastomas. 

The coding region lost is the N-terminal, cysteine 
rich domain, with possible involvement of the N-terminal 
portion of the ligand binding domain. The presence of the 
signal peptide with five N-terminal amino acids and in- 
frame coding sequence 3' of the rearrangement should ensure 
proper sorting and membrane insertion, respectively, of the 
aberrant protein • It seems an unlikely coincidence that the 
801 bases deleted from the transcripts reported here would 
account for the molecular weight differential observed 
between the normal 170kD receptor and the aberrant l4QkD 
receptor reported in two xenografted glioblastomas ^. Cell 
membrane preparations from the two xenografts containing the 
aberrant 140kD EGFR protein showed a significant elevation 
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Of tyrosine kinase activity in the absence of ligand ^' This 
suggests an aberrant EGFR functionally analogous to the 
constitutively kinase-active retroviral homologue of the 
EGFR protooncogene ; v-erb B, the expression of vhich can 
transform, for example, fibroblasts 
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1. Aberrant EGFR cDNA. 

2- Aberrant, rearranged EGFR cDNA capable of co- 
amplification and co-expression with non-rearranged cDNA in 
tumors • 

3. The cDNA of claim 2 wherein the tumor is a 
glioblastoma. 

4* The cDNA of claim 2 or 3 resulting in loss of 801 
coding bases comprising exons 2-7 with a resulting new codon 
at a novel splice site wherein the EGFR cDNA for the mRNA 
sequence for the signal peptide, the first five codons and 
the reading frame 3' of the rearrangement remain intact. 

5. Oligonucleotide probes for normal and aberrant EGFR 
comprising at least one selected from the group consisting 
of Int7, PC46, PC56, PC66, PC15, PC29, PC83, PC84, PC88, 
PCB9, PC85, PC77, PC59, PC67, PC58, PE7, PC54 , PC17 , PC64 , 
PC63, PC47 and PC34. 

6. Aberrant mRNA expressing protein coded for by the 
cDNA of claim 1 , 2 , 3 or 4 . 

7. The aberrant, rearranged EGFR protein resulting from 
expression of the cDNA of claim 1, 2, 3 or 4 ► 

8. The cDNA of claim 4 wherein the new splice site 
codon codes for glycine. 

9. The cDNA of claim 2 or 3 expressing aberrant EGFR 
transcripts comprising exon 1 followed by a sequence 
starting at base 52 of the 150 base long exon 8. 

10. Monoclonal antibody resulting from the protein 
antigen of claim 7. 
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